Total least squares based subband modelling for scalable speech representations with damped sinusoids

نویسندگان

  • Kris Hermus
  • Werner Verhelst
  • Patrick Wambacq
  • Philippe Lemmerling
چکیده

We describe how Total Least Squares (TLS) algorithms can be applied as a powerful and eÆcient modelling tool for wideband speech. A detailed description in both time domain and frequency domain illustrates how the modelling functions { damped sinusoids { naturally synthesise non-stationary signals. Straightforward implementations of TLS applied to fullband speech are known to be computationally hard and they can suffer from numerical sensitivity. In this paper we introduce a subband approach, which leads to a signi cant reduction of the computational load with an enhanced numerical stability. Moreover, it enables to control the distribution of the TLS components over the spectral range of the input signal such that perceptual criteria can be incorporated in the modelling scheme. We also address the scalability of our design from smallband speech to high quality audio, and provide evidence for the existence of coupled components in TLS modelled segments.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Perceptual audio modeling with exponentially damped sinusoids

This paper presents the derivation of a new perceptual model that represents speech and audio signals by a sum of exponentially damped sinusoids. Compared to a traditional sinusoidal model, the exponential sinusoidal model (ESM) is better suited to model transient segments that are readily found in audio signals. Total least squares (TLS) algorithms are applied for the automatic extraction of t...

متن کامل

Frequency and Damping Estimation Methods – an Overview

This overview paper presents and compares different methods traditionally used for estimating damped sinusoid parameters. Firstly, direct nonlinear least squares fitting the signal model in the time and frequency domains are described. Next, possible applications of the Hilbert transform for signal demodulation are presented. Then, a wide range of autoregressive modelling methods, valid for dam...

متن کامل

Speech synthesis using damped sinusoids.

A speech synthesizer was developed that operates by summing exponentially damped sinusoids at frequencies and amplitudes corresponding to peaks derived from the spectrum envelope of the speech signal. The spectrum analysis begins with the calculation of a smoothed Fourier spectrum. A masking threshold is then computed for each frame as the running average of spectral amplitudes over an 800-Hz w...

متن کامل

ACOUSTICS2008/2066 Damped sinusoids and subspace based approach for lossy audio coding

The new subspace-based techniques recently introduced appear to be well adapted for the parameters estimation of a damped sinusoids + noise signal model. These High-Resolution (HR) methods have a better frequency resolution than the Fourier analysis, but they are rarely used in audio coding. Although HR methods would be suitable for parametric coding at low bitrates, we show that they are also ...

متن کامل

Development of high quality acoustic subband echo canceller using dual-filter structure and fast recursive least squares algorithm

A high quality acoustic subband echo canceller is developed based on a dual-filter structure and the fast recursive least squares (FRLS) algorithm. Methods for overcoming the instability problem of the FRLS algorithm and implementing it using the 32-bit fixed-point arithmetic are presented. A new tap-weight transfer method, which assures double talk detection, is proposed. Computer simulations ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000